AMENDMENTS TO THE CLAIMS 



1 . (Cixrrently Amended) An object-based three-dimensional (3-D) audio server system 
comprising: 

an audio input unit receiving object-based sound sources through various input devices; 

an audio editing/producing unit separating the sound sources applied through the audio 
input unit into object sounds and background sounds according to a user's selection, and 
converting them into 3-D audio scene informatio n, wherein the sound sources selected by the 
user are processed into object sounds and other sounds not selected by the user are processed into 
background sounds : and 

an audio encoding unit encoding 3-D information and object signals of the 3-D audio 
scene information converted by the audio editing/producing imit so as to transmit them through a 
medium. 

2. (Cancelled) 

3. (Original) The system according to claim 1, wherein the audio input unit includes: 
a combination of sound source input devices having: 

a single channel microphone with a single microphone; 

a stereo microphone with at least two microphones; 

a dummy head microphone whose shape is like a head of a human body; 

an ambisonic microphone receiving the sound sources after dividing them into 
signals and volume levels, each moving with a given trajectory on 3-D X, Y, and Z 
coordinates; and 

a multi-channel microphone receiving multitrack audio signals; and 
a source separation/3 -D information extractor separating the sound sources apphed from 
the combination of the sound source input devices by objects, and extracting 3-D information. 

4. (Original) The system according to claim 1, wherein the audio editing/producing unit 

includes: 
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a router/audio mixer dividing the sound sources applied in the multi-track format into a 
plurality of sound source objects and background sounds; 

a scene editor/producer editing an audio scene and producing the edited audio scene by 
using 3-D information and spatial information of the sound source objects and background sound 
objects divided by the router/audio mixer; and 

a controller providing a user interface so that the scene editor/producer edits an audio 
scene and produces the edited audio scene under the control of a user. 

5. (Original) The system according to claim 1, wherein the audio encoding xmit includes: 
a data encoding block encoding each set of data divided into background sound objects, 

sound source objects, and audio scene information output from the audio editing/producing unit; 
and 

a multiplexer multiplexing object data of the backgroimd sound, data of the sound 
sources, and data of the audio scene information encoded by the data encoding block into a single 
signal, and transmitting the same. 

6. (Previously Presented) The system according to claim 5, wherein the data encoding block 
includes: 

an audio object encoder encoding the sound objects; 

an audio scene information encoder encoding the audio scene information; and 
a background soimd object encoder encoding the background soimds. 

7. (Currently 7\mended) A method of controlling an object-based 3-D audio server system 

comprising: 

separating sound source objects from among sound sources according to a selection by a 
use r, wherein the sound sources selected by the user are processed into sound source objects and 
other sounds not selected by the user are processed into background sounds : 

inputting 3-D information for each sound source object separated from the applied sound 
sources; 
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mixing sound sources other than the separated sound source objects into background 
sounds; and 

forming the sound source objects, the 3-D information, and the backgroimd sound objects 
into an audio scene, and encoding and multiplexing the audio scene to transmit the encoded and 
multiplexed audio signal through a medium. 

8. (Original) The method according to claim 7, wherein each of the sound source objects 
further includes 3-D information for a relative soimd source object by grouping the sound source 
objects that have to be controlled by groups. 

9. (Original) An object-based three-dimensional audio terminal system comprising: 

an audio decoding unit demultiplexing and decoding a multiplexed audio signal including 
object sotmds, background soxmds, and scene information appHed through a medium; 

an audio scene-synthesizing xmit selectively synthesizing the object sounds with the audio 
scene information decoded by the audio decoding unit into a 3-D audio scene imder the control of 
a user; 

a user control unit providing a user interface so as to selectively synthesize the audio 
scene by the audio scene synthesizing unit under the control of the user; and 

an audio reproducing unit reproducing the 3-D audio scene synthesized by the audio 
scene-synthesizing unit. 

1 0. (Original) The system according to claim 9, wherein the audio decoding unit includes: 
a demultiplexer demultiplexing the data applied through the medium and multiplexed to 

separate them into background sound object data, sound source data, and audio scene information 
data; and 

a decoder decoding the background sound object data, the sound source data, and the 
audio scene information data separated by the demultiplexer. 

1 1 . (Original) The system according to claim 9, wherein the audio scene-synthesizing unit 
includes: 
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a sound source object processor receiving the background sound objects, the sound source 
objects, and the audio scene information decoded by the audio decoding unit to process the sound 
source objects and audio scene information according to a motion, a relative location between the 
sound source objects, and a three-dimensional location of the sound source objects, and spatial 
characteristics under the control of the user; and 

an object mixer mixing the sound source objects processed by the sound source object 
processor with the background sound objects decoded by the audio decoding unit to output 
results. 

12. (Original) The system according to claim 9, wherein the sound source object processor 
further includes: 

a motion processor analyzing a plurality of sound source data and the audio scene 
information, calculating a location of each sound soiu-ce object moving with its particular 
trajectory, and modifying its trajectory under the control of the user through the user control unit; 

a group object processor calculating a relative location of the respective soimd source 
objects when a plurality of the sound source objects is grouped, and controlling the relative 
location of the sound source objects under the control of the user through the user control unit; 

a 3-D sound localization processor providing each sound source object having a location 
defined on 3-D coordinates with directivity in response to a listener's location under the control 
of the user control unit; and 

a 3-D space modeling processor providing a sense of closeness and remoteness and 
spatial effects to each sound source object according to characteristics of a 3-D space. 

13. (Original) The system according to claim 9, wherein the audio reproducing unit includes: 
an acoustic environment equalizer equalizing the acoustic environment between a listener 

and a reproduction system in order to accurately reproduce the 3-D audio transmitted from the 
audio scene synthesizing unit; 

an acoustic environment corrector calculating a coefficient of a filter for the acoustic 
environment equalizer's equalization, and correcting the equalization by the user; and 
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an audio signal output device outputting a 3-D audio signal equalized by the acoustic 
environment equalizer. 

14. (Original) The system according to claim 9, wherein the acoustic environment equalizer 
further includes: 

means for equalizing the enviroimiental characteristics between the listener and the audio 

terminal system in order to accurately reproduce 3-D audio; 

means for canceling crosstalk transmitted to right and left ears of the listener; and 
means for correcting the characteristics of the acoustic environment automatically or in 

response to the user's input, according to the information on speakers of the audio system, a 

listening room's construction, and arrangement of the speakers, transmitted from the acoustic 

environment corrector. 

15. (Original) The system according to claim 9, wherein the user control unit includes an 
interface that controls each sound source object and the listener's direction and position, and 
receives the user's control for maintaining reaUsm of sound reproduction in a virtual space to 
transmit a control signal to each unit. 

16. (Original) A method of controlling an object-based 3-D audio terminal system 
comprising: 

in receiving and outputting an object-based 3-D audio signal, decoding the audio signal 
applied through a medium, and dividing the audio signal into object sounds, 3-D information, 
and backgroimd sounds; 

performing motion processing, group object processing, 3-D sound localization, and 3-D 
space modeling on the object sounds and the 3-D infonnation to modify and apply the processed 
object sounds and 3-D information according to a user's selection, and mixing them with the 
background sounds; and 

equalizing the mixed audio signal in response to correction of characteristics of the 
acoustic environment that the user controls, and outputting the equalized signal. 
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17. (Original) The method according to claim 16, wherein synthesizing the audio scene 
further includes: 

processing a motion effect of each object moving with a particular trajectory, in response 
to a control signal output from a user control unit; 

grouping the object, and calculating and processing a relative location of each grouped 

object; 

processing 3-D sound localization by providing each soimd source object having a 
location defined on 3-D coordinates with directivity in response to a listener's position; 

processing 3-D space modeling by providing the object with a sense of closeness and 
remoteness and spatial effects according to characteristics of a 3-D space; and 

mixing the processed sound source object with the backgrotmd sound object to synthesize 
a 3-D audio scene. 

18. (Original) The method according to claim 16, wherein outputting the audio scene further 
includes: 

equalizing the 3-D audio output according to information on characteristics of the 
acoustic environment between a listener and the audio system, and information on correcting the 
acoustic environment applied by the user; and 

outputting the equahzed 3-D audio scene to provide the same to the listener. 

19. (Original) An object-based three-dimensional audio system comprising: an audio input 
imit receiving object-based sound sources through input devices; 

an audio editing/producing unit separating the sound sources applied through the audio 
input unit into object sounds and background sounds according to a user's selection, and 
converting them into three-dimensional audio objects; 

an audio encoding unit encoding 3-D information of the audio objects and object signals 
converted by the audio editing/producing unit to transmit them through a medium; 

an audio decoding unit receiving the audio signal including object sounds and 3-D 
information encoded by the audio encoding unit through the medium, and decoding the audio 
signal; 



003364 J>128 



7 



10/692,769 



an audio scene synthesizing unit selectively synthesizing the object sounds with 3-D 
information decoded by the audio decoding unit into a 3-D audio scene under the control of a 

user; 

a user control unit outputting a control signal according to the user's selection so as to 
selectively synthesize the audio scene by the audio scene synthesizing unit under the control of 
the user; and 

an audio reproducing unit reproducing the audio scene synthesized by the audio scene 
synthesizing unit. 

20. (Original) A method of controlling an object-based 3-D audio terminal system, 
comprising: 

separating sound source objects from among sound sources according to a selection by a 

user; 

inputting 3-D information on the separated sound source objects; 

processing sound sources other than the input soimd source objects and 3-D information 
as background sounds; 

forming the sound source objects, the 3-D information, and the background sounds into 
an audio scene, and encoding and multiplexing the audio scene to transmit the encoded and 
multiplexed audio scene through a medium; 

decoding the audio signal appUed through a medium, and dividing the audio signal into 
object sounds, 3-D information, and background sounds; performing motion processing, group 
object processing, 3-D sound localization, and 3-D space modeling with respect to the object 
sounds and the 3-D information to modify and apply the processed object sounds and 3-D 
information according to a user's selection, and mixing them with the background sounds; and 

equalizing the mixed audio signal in response to correction of characteristics of the 
acoustic environment that the user controls, and outputting the equalized audio signal. 
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